U.S. S.N 10/043,356 
LaPointe et ai 

PRELIMINARY AMENDMENT 

REMARKS 

Any fees that may be due in connection with this application throughout 
its pendency may be charged to Deposit Account No. 50-1213. 

Claims 1-6 are pending. Claim 5 is amended to correct a minor 
typographical, error and to produce grammatical clarity. No new matter has 
been added. 

The specification is amended to correct obvious typographical and 
spelling errors and to produce grammatical clarity. The amendment to the 
paragraph on page 1, tines 3-23, of the specification adds the inadvertently 
omitted phrase "now abandoned" to provide the current status of the cited 
applications. The amendment to the paragraph on page 24, lines 9-14, of the 
specification deletes the second occurrence of the phrase "used as 41 for 
grammatical clarity. The amendment to the paragraph on page 27, line 23, 
through page 28, line 5, of the specification adds the inadvertently omitted verb 
"described" for grammatical clarity. The amendment to the paragraphs on page 
31, lines 1 1-30, of the specification adds the inadvertently omitted phrase n Fig. 
3A" to the cited steps for clarity. The amendment finds basis in Figure 3A. The 
amendment to the paragraphs on page 35, line 14, through page 36, line 1 7, of 
the specification adds the inadvertently omitted phrase "Fig. 3A" to the cited 
steps for clarity. The amendment finds basis in Figure 3A. The amendment to 
the paragraph on page 37, lines 16-29, of the specification adds the 
inadvertently omitted phrase "Fig. 3A" to the cited steps for clarity. The 
amendment finds basis in Figure 3A. The amendment to the paragraphs on 
page 38, line 9, through page 40, line 2, of the specification adds the 
inadvertently omitted phrases "Fig. 3A" and "Fig. 3B" to the cited steps for 
clarity. The amendment finds basis in Figures 3A and 3B. The amendment to 
' the paragraph on page 50, linelO, through page 51, line 24, of the 
specification adds the inadvertently omitted verb "described" for grammatical 
clarity. The amendment to the paragraph on page 87, lines 4-18, of the 
specification adds the inadvertently omitted number "5" for grammatical clarity. 

-26- 



PAGE 55/94 * RCVD AT 8/24/2005 9:56:11 PM [Eastern Daylight Time] ^ SVR:USPT(«FXRF-6/24 ^ DNB:2733687 * CSID: * DURATION (mm-ss):2W2 



U.S.S.N 10/043,356 
LaPointe et at. 

PRELIMINARY AMENDMENT 

The amendment finds basis at page 87, lines 26-27, of the specification. No 
new matter has been added. 

The abstract is amended to correct a minor typographical error. No new 
matter has been added. 

Included as an attachment is a marked-up version of the specification 
paragraphs that are being amended, per 37 CFR 51.121. 



Entry of this amendment and examination of the application are 
respectfully requested. 
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APPLICATIONS 



Art Unit: 
Examiner: 



2122 

Unassigned 



ATTACHMENT TO THE PRELIMINARY AMENDMENT 



MARKED UP PARAGRAPHS AND CLAIMS (37 CFR 9 1.1 21) 

IN THE SPECIFICATION: 

Please amend the specification as follows: 
Please amend the paragraph on page 1, lines 3-23, as follows: 
This application is a continuation of allowed U.S. application Serial No. 
08/912,133, filed August 14, 1997, to Jerome Lapointe and Duane DeSieno 
entitled "METHOD FOR SELECTING MEDICAL AND BIOCHEMICAL 
DIAGNOSTIC TESTS USING NEURAL NETWORK-RELATED APPLICATIONS." 
This application is a continuation-in-part of U.S. application Serial 08/798,306, 
filed February 7, 1997 entitled "METHOD FOR SELECTING MEDICAL AND 
BIOCHEMICAL DIAGNOSTIC TESTS USING NEURAL NETWORK-RELATED 
APPLICATIONS" to Jerome Lapointe and Duane DeSieno . now abandoned . This 
application is also a continuation-in-part of U.S. application Serial 08/599,275, 
filed February 9, 1996, to Jerome Lapointe and Duane DeSieno, entitled 
-METHOD FOR DEVELOPING MEDICAL AND BIOCHEMICAL DIAGNOSTIC 
TESTS USING. NEURAL f NETWORKS. "1 NETW0RKS." now abandoned. U.S. 
application Serial No. 08/798,306 is a continuation-in-part of U.S. application 
Serial No. 08/599,275. This application and U.S. application Serial No. 
08/798,306 claim the benefit of priority under 35 U.S.C. §1 1 9(e) to U.S. 
provisional application Serial No. 60/011,449, entitled "METHOD AND 
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APPARATUS FOR AIDING IN THE DIAGNOSIS OF ENDOMETRIOSIS USING A 
PLURALITY OF PARAMETERS SUITED FOR ANALYSIS THROUGH A NEURAL 
NETWORK" to Jerome Lapointe and Duane DeSieno, filed February 9, 1996. 

Please amend the paragraph beginning on page 6. line 27, through page 
7. line 25, as follows: 

Because of the predictive and intuitive nature of medical diagnosis, 
attempts have been made to develop neural networks and other expert systems 
that aid in this process. The application of neural networks to medical diagnosis 
has been reported. For example, neural networks have been used to aid in the 
diagnosis of cardiovascular disorders (see, e.g., Baxt (1991) "Use of an Artificial 
Neural Network for the Diagnosis of Myocardial Infarction," Annals of Internal 
Medicine 775:843; Baxt (1992) "Improving the Accuracy of an Artificial Neural 
Network Using Multiple Differently Trained Networks," Neural Computation 
4:772; Baxt (1992), "Analysis of the clinical variables that drive decision In an 
artificial neural network trained to identify the presence of myocardial 
infarction," Annals of Emergency Medicine 27:1439; and Baxt (1994) 
"Complexity, chaos and human physiology: the justification for non-linear neural 
computational analysis," Cancer Letters 77:85). Other medical diagnostic 
applications include the use of neural networks for cancer diagnosis (see ; e.g. 
Maclin, etal. [(19910)H99ll "Using Neural Networks to Diagnose Cancer" 
Journal of Medical Systems 75:11-9; Rogers, etal. (1994) "Artificial Neural 
Networks for Early Detection and Diagnosis of Cancer" Cancar Letters 77:79- 
83; Wilding, etal. (1994) "Application of Backpropogation Neurai Networks to 
Diagnosis of Breast and Ovarian Cancer" Cancer Letters 77:145-53), 
neuromuscular disorders (Pattichis, etal. (1995) "Neural Network Models in 
EMG Diagnosis", IEEE Transactions on Biomedical Engineering 42:5:486-495), 
and chronic fatigue syndrome (Solms, etal. (1996) "A Neural Network 
Diagnostic Tool for the Chronic Fatigue Syndrome", International Conference on 
Neural Networks, Paper No. 108). These methodologies, however, fail to 
address significant issues relating to the development of practical diagnostic 
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tests for a wide range of conditions and do not address the selection of input 
variables. 

Please amend the paragraph on page 12, lines 9-21, as follows: 
The resulting systems are used as an aid in diagnosis. In addition, as the 
systems are used, patient data can be stored and then used to further train the 
systems and to develop systems that are adapted for a particular genetic 
population. This inputting of additional data into the system may be 
implemented automatically or done manually. By doing so the systems 
continually learn and adapt to the particular environment in which they are used. 
The resulting systems have numerous uses in addition to diagnosis, which 
includes assessing the severity of a disease or disorder, and predicting the 
outcome of a selected treatment protocol. The systems may also be used to 
assess the value of other data in a diagnostic procedure, such as biochemical 
test data and other such data, and to identify new tests that are useful for 
diagnosing a particular disease. 

Please amend the paragraph on page 13, lines 8-16, as follows: 
The methods are exemplified with reference to neural 
tnetworksjn^srks: however, it is understood that other data mining tools 
such as expert systems, fuzzy logic, decision trees, and other statistical 
decision-support systems which are generally non-linear, may be used 
Although the variables provided herein are intended to be used with decision- 
support systems, once the variables are identified, then a person, typically a 
Phys.can, armed with knowledge of the important variables can use them to aid 
-n d.agnosis in the absence of a decision-support system or using a .ess complex 
linear system of analysis. 

Please amend the paragraph on page 18, lines 16-20. as follows: 
Diagnostic software and exemplary neural networks that use the variables 
for diagnosis of endometriosis and the risk of delivery before a specified time 
are also provided. Software thaj generates a clinically useful endometriosis 
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index is provided as software that generates an index for assessing the risk are 
provided. 

Please amend the paragraph on page 20, lines 22-23. as follows- 
FIGURES 1 6A and 1 6B [showsjshow exemplary outputs from the 
rsoftwarej software; FIGURE 16B includes the input data as well; 

Please amend the paragraph on page 24, lines 9-14, as follows: 
As used herein, partition means to select a portion of the data, such as 
80%, and use it for training a neural net and to use the remaining portion as test 
data. Thus, the network is trained on all but one portion of the data The 
process can then be repeated and a second network trained. The process is 
repeated until all partitions are used as [used as] test data and training data. 
Please amend the paragraph on page 26, lines 18-23, as follows- 
As used herein, performance of a system is said to be improved or higher 
when the results more accurately predict or determine a particular outcome. It is 
also to be understood that the performance of a system will typica.ly be better 

as more training examples are used. Thus, the systems herein will improve over 
•me a s they are used and more ^ ^ ^ ^ ^ 

to the systems as training data. 

«. »»?ZZ1? para9raph ^ on paae 27 - " ne * « - 

As shown herein, neural network may also reveal that certain input 
factors tha, were no, initially considered ,o be important can influence an 
outcome as we,, as reyea, mat presumeb,y important factors are no, outcome 
de,erm,na„y.. The abiihy of neura, ne,w.*e , 0 reyea, m e re,eyan, end 

rraieyan, inpu, ,ac,ors permh me h fluldins the dsslgn of § ^ 

Aa shown herein, n.ural. networks, and o,her such de,a minlno ,oo,s. are e 
ye,ueb,e adyance in di. 8 „os,ios. provid,n 9 en 0 pp Ort uni,y ,o increase me 

ZT ^ r Cm0i,V " 3 d ' a9n0S,i0 ,45, • A ' Sh ° W " — ™* b. 

faken ,o ayo.d the po,en,ia, of poor-aecuracy enswer due ,o ,h. phenomenon of 
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local minima. The methods scribed herein provide a means to avoid this 
problem or at least minimize it. 

Please amend the paragraphs on page 31, lines 11-30, as follows: 

Isolation of important or relevant variables -ranking the variables 
[FiguesS 3A-BjFiqures 3A-3B provide a flow chart of the process for 
isolating the important or relevant variables (Fi g . 3A f Step .E) within a diagnostic 
test. Such a process is typically conducted using a digital computer system to 
which potentially relevant information has been provided. This procedure ranks 
the variables in order of importance using two independent methods, then.se- 
lects a subset of the available variables from the uppermost of the ranking. As 
noted above, other ranking methods can be used by those of skill in the art in 
Place of chi square or sensitivity analysis. Also, if x is set to N (the total 
number of candidate variables), then ranking can be arbitrary. 

The system trains a plurality of neural networks on the available data (Fig, 
3A, Step I), as explained hereinafter, then generates a sensitivity analysis over 
all trained networks to determine to what extent each input variable was used in 
the network to perform the diagnosis (Fig. 3A f Step J,. A consensus sensitivity 
analysis of each input variable is determined by averaging the individual 
sensitivity analysis results for each of the networks trained. Based upon 
sensitivity, a ranking order for each of the variables available from the patient 
history information is determined ( Fig. 3ft f Step K). 

Please amend the paragraphs beginning on page 35, line 14, through 
page 36, line 17, as follows: 

Chi-square contingency table 

When dealing with sparse binary data, a positive response on a given 

variable might be highly correlated to the condition being diagnosed, but occur 

so rnfrequently in the training data that the importance of the variable as 

■ndicated by the neural network sensitivity analysis, might be very low. In order 

to catch these occurrences, the Chi-square contingency table is used as a 

secondary ranking process. A 2X2 contingency table Chi-square test on the 
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binary variables, where each cell of the table is the observed frequency for the 
combination of the two variables (Fig. 3A, Step F) is performed. A 2X2 
contingency table Chi-square test is performed on the continuous variables using 
optimal thresholds {which might be empirically-determined) ( Fig. 3A. Step G)> 
The binary and continuous variables that have been based on Chi-square 
analysis are ranked ( Fio. 3A. Step H). 

The standard Chi-square 2X2 contingency table operative on the binary 
variables ( Fiq. 3A , Step F) is used to determine the significance of the 
relationship between a specific binary input variable and the desired output (as 
determined by comparing the training data with the known single output result). 
Variables that have a low Chi-square value are typically unrelated to the desired 
output. 

For variables that have continuous values, a 2X2 contingency table can 
be constructed (FJg^A, Step G) by comparing the continuous variable to a 
threshold value. The threshold value is modified experimentally to yield the 
highest possible Chi-square value. 

The Chi-square values of the continuous variables and of the binary 
variables can then be combined for common ranking ( Fig. 3A. Step H). A 
second level of ranking can then be performed that combines the Chi-square- 
ranked variables with the sensitivity-analysis-ranked variables ( Fig. 3A. Step L). 
This combining of rankings allows variables that are significantly related to the 
output but that are sparse (Ae, values that are positive or negative in only a 
small percentage of cases) to be included in the set of important variables. 
Otherwise, important information in such a non-linear system could easily be 
overlooked. 

Please amend the paragraph on page 37, lines 16-29, as follows: 
In particular, the second level ranking process ( Fio. 3A. Step L) starts by 
add,ng the highest ranked variable from the sensitivity analysis (Step K) to the 
set of important variables (fig^A, Step H). Alternatively, the second level 
rank,ng process could be started with an empty set and then testing the top 
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several (x) variables from each of the two sets of ranking. This second level 
ranking process uses the network training procedure ( Fio. 3A, Step I) on a 
currently selected partition or subset of variables from the available data to train 
a set of neural networks. The ranking process is a network training procedure 
using the current set of "important" variables (which generally will initially be 
empty) plus the current variable being ranked or tested for ranking, and uses a 
greedy algorithm to optimize the set of input variables by myopically optimizing 
the input set based upon the previously identified important variable(s), to 
Identify the remaining variable(s) which improve the output the most. 

Please amend the paragraphs beginning on page 38, line 9, through page 
40. line 2, as follows: 

A number of other commercially available neural network computer 
programs may be used to perform any of the above operations, including 
tBrainmaker^ Brainmaker- , which is available from California Scientific Software 
Co., Nevada Adaptive Solutions, Beaverton, OR; Neural Network 
[UtilityCTUtility/2-, from NeuralWare, Inc., Pittsburgh, PA; 
[NeuroSheiri NeuroSheir and [NeuroWindows^l NeuroWindnw^ from Ward 
Systems Group, Inc., Frederick, MO. Other types of data mining tools, /.*., 
decision-support systems, that will provide the function of variable selection and 
network optimization may be designed or other commercially available systems 
may be used. For example, NeuroGenetic Optimizer" from BioComp Systems, 
Inc., Redmond, WA; and-Neuro Forecaster/GENETICA, from New Wave 
Intelligent Business Systems (NIB5) Pte Ltd., Republic of Singapore, use genetic 
algorithms that are modelled on natural selection to eliminate poor-performing 
nodes within network population while passing on the best performing rates to 
offspring nodes to "grow" an optimized network and to eliminate input variables 
which do not contribute significantly to the outcome. Networks based on 
genetic algorithms use mutation to avoid trapping in local minima and use 
crossover processes to introduce new structures into the population. 
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Knowledge discovery in data (KDD) is another data mining tool, decision- 
support system, designed to identify significant relationship is that exist among 
variables, and are useful when there are many possible relationships. A number 
of KDD systems are commercially available including fDarw?n""1 Darwin". from 
Thinking Machines, Bedford, MA; [Mineset^ Mineset" . from Silicon Graphics, 
Mountain View, CA, and [Eikoplex^ Eikoolex- from Uhragem Data Mining 
Company, San Francisco, CA. [(Eikopiexn tEikoplex' h as been used to provide 
classification rules for determining the probability of the presence of heart 
disease.) Others may be developed by those of skill in the art. 

Proceeding with the ranking procedure, if, for example, x is set to 2, then 
the top two variables from each of the two ranking sets will be tested by the 
process (Fig. 3A, Steps L, S), and results are checked to see if the test results 
show improvement (FJg^ Step T). If there is an improvement, the single best 
performing variable is added to the set of "important" variables, and then that 
variable is removed from the two rankings (Fig, 3B, Step U) for further testing 
(fiiSA, Step S). If there is no improvement, then the process is repeated with 
the next x variables from each set until an improvement is found or all of the 
variables from the two sets have been tested. This process is repeated until 
e-ther the source sets are empty, i.e., all relevant or important variables have 
been included in the final network, or all of the remaining variables in the sets 
bemg tested are found to be below the performance of the current list of 
•mportant variables. This process of elimination great.y reduces the number of 
subsets of the available variables which must be tested in order to determine the 
set of .mportant variables. Even in the worst case,with ten available variables 
the process would test only 34 subsets where x^2 and only 19 subsets of the 
1024 possible combinations if x = 1. Thus, where there are 100 available 
vanables, only 394 subsets would be tested where x- 2. The variables from 
the network with the best test performance are thus identified for use (Fig 3B 
Step V). a " ' 
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Please amend the paragraph on page 48, lines 3-17, as follows: 

S. Method for evaluating the effectiveness of a diagnostic test course of 
[treatmenttl treatment 

Typically, the effectiveness or usefulness of a diagnostic test is 
determined by comparing the diagnostic test result with the patient medical 
condition that is either known or suspected. A diagnostic test is considered to 
be of value if there is good correlation between the diagnostic test result and 
the patient medical condition; the better the correlation between the diagnostic 
test result and the patient medical condition, the higher the value placed on the 
effectiveness of the diagnostic test. In the absence of such a correlation, a 
diagnostic test is considered to be of lesser value. The systems provided 
herein, provide a means to assess the effectiveness of a biochemical test by 
determining whether the variable that corresponds to that test is an important 
selected variable. Any test that yields data that improves the performance of 
the system is identified. 

Please amend the paragraphs beginning on page 50. line 10, through 
page 51, line 24, as follows: 
Endometriosis 

The methods described herein, have provided a means to develop a non- 
invasive methodology for the diagnosis of endometriosis. In addition, the 
methods described herein provide means to develop biochemical tests that 
provide data indicative of endometriosis, and also to identify and develop new 
biochemical tests. 

The methodology for variable selection and use of decision-support 
systems, has been applied to endometriosis. A decision-support system, in this 
instance, a consensus of neural networks, has been developed for the diagnosis 
of endometriosis. In the course of this development, which is detailed in the 
EXAMPLES, it was found that it was possible to develop neural networks 
capable of aiding in the diagnosis of endometriosis that only rely on patient 
historical data, i.e.. data that can be obtained from a patient by questionnaire 
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format. It was found that biochemical test data could be used to enhance the 
performance of a particular network, but it was not essential to its value as a 
diagnostic tool. The variable selection protocol and neural nets provide a means 
to select sets of variables that can be inputted into the decision-support system 
to provide a means to diagnose endometriosis. While some of the identified 
[variables JyadaJaiej Include those that have traditionally been associated with 
endometriosis, others of the variables have not. In addition, as noted above, 
variables, such as pelvic pain and dysmenorrhea that have been associated with 
endometriosis are not linearly correlated with it to permit diagnosis. 

Exemplary decision-support system are described in the 
[Example] Examples . For example, one neural net, designated pat07 herein, is 
described in Example 14. Comparison of the outputof the pat07 network 
output with the probability of having endometriosis yields a positive correlation 
(see Table 1). The pat07 network can predict the likelihood of a woman having 
endometriosis based on her pat07 score. For example, if a woman has a pat07 
score of 0.6, then she has a 90% probability of having endometriosis; if her 
pat07 score is 0.4, then she has a 10% probability of having endometriosis. 
The dynamic range of pat07 output when applied to the database was about 
0.3 to about 0.7. Theoretically, the output values can range from 0 to 1, but 
values below 0.3 or above 0.7 were not observed. Over 800 women have been 
evaluated using the P at07 network, and its performance can be summarized as 
follows: 

TABLE 1 



Pat07 Score 


Endometriosis (% of Total) 


< 0.40 


10 


0.40 - 0.45 


30 


0.45-0.55 


50 


0.55 - 0.60 


70 


> 0.60 


90 
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Please amend the paragraphs beginning on page 54, line 10, through 
page 55. line 11, as follows: 

As described in the Examples, the pat07 is not the only network that is 
predictive of endometriosis. Other networks, designated pat08 through pat23a 
have been developed. These are also predictive of endometriosis. All these 
networks perform very similarly, and can readily be used in place of pat07. 
Thus, by following the methodology used to develop pat07, other similarly 
functioning neural nets can be and have been developed. Pat08 and pat09 are 
the most similar to [pat07:]pat07; these networks were developed by following 
the protocol outlined above, and were allowed to select important variables from 
the same set as that used for development of pat07, 

It was found that the initial weighting of variables can have effects on the 
outcome of the variable selection procedure, but not in the ultimate diagnostic 
result. Pat08 and pat09 used the same database of patient data as pat07 to 
derive the disease relevant parameters. Pat10 through pat23a were training 
runs originally designed to elucidate the importance of certain parameters: 
history of endometriosis, history of pelvic surgery, dysmenorrhea and pelvic 
pain. For development of these, the importance of a variable was assessed by 
withholding that variable from the variable selection process. It was found that 
in the variable selection process and training the final consensus networks, 
network performance did not significantly deteriorate. 

Thus, although a particular variable, or set of variables, may have 
appeared to be significant in predicting endometriosis, networks trained in the 
absence of such variables do not have a markedly reduced ability to predict 
endometriosis. These results [to] demonstrate (1 ) the effectiveness of the 
methodology for variable selection and consensus network [training;. I trainino 
and (2) the adaptability of networks in general. In the absence of one type of 
data, the network found other variable(s) from which to extract that information. 
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In the absence of one variable, the network selected different variables in its 
place and maintained performance. 

Please amend the paragraph on page 57, lines 18-28, as follows: 
Determination of impending birth is of importance, for example, for 
increasing neonatal survival of infants born before 34 weeks. The presence of 
fetal fibronectin in secretion samples from the vaginal cavity or the cervical 
canal from a pregnant patient after week 20 of pregnancy is associated with a 
risk of labor and delivery before 34 weeks. Methods and kits for screening for 
fetal fibronectin in body fluids and tissues, particularly in secretion samples from 
the vaginal cavity or the cervical canal, of a pregnant patient after week 20 of 
pregnancy are available (see, U.S. Patent Nos. 5,516,702, 5,468,619, and 
5,281,522, and 5,096,830; see, also U.S. Patent Nos. 5,236,846, 5,223,440, 
5,185,270). 

Please amend the paragraphs beginning on page 73. line 13, through 
page 74, line 29, as follows: 

Table 1. Performance comparison of Qualitative fFN ELISA Test and the 
Preterm Delivery Risk Assessment Software Test relative to risk of preterm 
delivery before 35 completed weeks of gestation. The Risk Assessment 
Software combines fFN ELISA Test results with patient history and symptom 
information to provide a more accurately [assessl assessed risk of preterm 
delivery (before 35 completed weeks of gestation). 

Table 2 compares the performance of the two tests relative to risk of 
preterm delivery within 7 days. The largest difference between the two tests is 
in the reduction of false positive test results of the software when compared to 
the ELISA test. The software decreased the number of false positive 
observations from 129 to 57, or about 56%. Accompanying this decrease In 
false positive results is the matching [increasingjjncrease in true negative 
results, from 611 to 683. The true positive and false negative results remained 
essentially unchanged. The sensitivity and specificity of the software test is 
much higher than the ELISA test. Compare the sensitivity of 91.3% for the 
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software with 87.0% for the ELISA, and the specificity of 92.3% for the 
software with 92.3% for the EUSA. Furthermore, the software test doubles the 
positive predictive value, increasing form 13.4% to 26.9%. Finally, the odds 
ratio is quadrupled and the relative risk more than tripled by the software. 



MEASURE 


QUAL fFN ELISA TEST 


RISK ASSESSMENT 
SOFTWARE TEST 


True Positive 


20 


21 


False Positive 


129 


57 


True Negative 


611 


683 


False Negative 


3 


2 


Sensitivity 


87.0% 


91.3% 


Specificity 


82.6% 


92.3% 


PosPV 


13.4% 


26.9% 


Neg PV 


99.5% 


99.7% 


Odds Ratio 


31.6 


125.8 


Relative Risk 


27.4 


89.7 



Table 2. Performance comparison of Qualitative fFN ELISA Test and the 
Preterm Delivery Risk Assessment Software Test relative to risk of preterm 
delivery within 7 days. H 

Table 3 compares the performance of the two test relative to risk of 
preterm delivery within 14 days. Once again, the software decreases false 
positive test results when compared to the ELISA test, from 124 to 55, or about 
53%. This decrease in false positive results is matched by the corresponding 
increase in true negative results, from 609 to 678. The number of true positive 
and false negative results were unchanged. [WhilstlWhjje. the sensitivity of the 
test was unaffected, the specificity of the test rose nearly 10 points, increasing 
from 83.1 % to 92.5%. As seen before, the positive predictive value nearly 
doubled, increasing from 16.8% to 31.3%. and the odds ratio and relative risk 
increased substantially from 24.6 to 61.6 and from 20.7 to 44.7, respectively. 
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Please amend the paragraph on page 77, lines 13-26, as follows: 
Methodology Used 

The most commonly used method for determining the importance of 
variables is to train a neural network on the data with all the variables included. 
Using the trained network as the basis, a sensitivity analysis is performed on the 
network and the training data. For each training example the network is run in 
the forward mode (no training). The network outputs were recorded. 
(TheJJJien for each input variable, the network is rerun with the variable 
replaced by [it'sjits average value over the training example. The. difference in 
output values is squared and accumulated. This process is repeated for each 
training example. The resulting sums are then normalized so that the sum of the 
normalized values equals the number of variables. In this way, if all variables 
contribute equally to the output, their normalized value would be 1 .0. The 
normalized value can then be ranked in order of importance; 

Please amend the paragraphs on page 81 , lines 13-26, as follows: 
Methodology Used 

When dealing with small training examples, the holdout method is 
effective in providing test information useful in determining network 
configuration and parameter settings. In order to maximize the data available 
for training without a big increase in processing [timeltime. a 20% holdout was 
used instead of the proposed 25%. This produced 5 partitions of the data 
instead of 4 and made 80% of the data for training in each partition. 

To minimize the effects of the random starting weights, several networks 
were trained in the full training runs. In [theseslthese runs three networks were 
trained in each of the five partitions of data, each from a different random start. 
The outputs of the networks were averaged to form a consensus result that has 
a lower variance than could be obtained from a single network. 
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Please amend the paragraph on page 83, lines 17-31, as follows: 

EXAMPLE 3 
Preprocessing and input Western Blot Data 
Requirements 

The antigen data, from Western Blots, for the patients that was originally 
delivered to Logical Designs provided information on only the peak molecular 
weights and their associated intensities. Analysis of this data and of the original 
images from which the data was taken, suggests that it may be possible to use 
the original image digitized in a way that could provide more information to the 
neural network. In examining the original images for two experiments, [it] 
preprocessing of the image data decreases the variability of the position of a 
specific molecular weight in the image. This preprocessing will use a polynomial 
fit through the standards image to produce a modified image. Preprocessing of 
the images will also include steps to normalize the background level and 
contrast of the images. 

Please amend the paragraph on page 85, lines 24-29. as follows: 
The weights of the network were then averaged together to generate a 
consensus value for each weight. Since the interconnection weight from the 
hidden (clementlejement to the output could be either positive or [negative. 
Thelnegative, the weights were transformed so that all the output connections 
had the same sign. The weights were then averaged and the results plotted 
using Excel. 

Please amend the paragraph on page 86, lines 8-24, as follows: 
The plot of the sensitivity analysis and of the weights for the final 
consensus networks indicated that there are regions on the Western Blot that 
can aid in the prediction and diagnosis of the disease. The width of the regions 
of positive and negative correlation, as seen in the network weights, also 
indicates that the results shown are significant. If the peaks had been very 
narrow, one would have to conclude that the peaks were artifacts of the 
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training process, similar to overtraining, and not [form! from the underlying 
process being learned. The regions that appear important are as follows: 
Positive Correlation: 

31503.98-34452.12 
62548.87 - 65735.97 
84279.36 - 89458.49 
Negative Correlation: 

19165.9 - 20142.47 
50263.36- 53352.14 
67725.77 ■ [7861 4.77] 7861 4.77. 
Please amend the paragraph on page 87, lines 4-18, as follows: 

EXAMPLE 4 

Investigate Fixed Input Dimension for Western Blot Data 
Requirements 

Using peak molecular weights extracted from. the preprocessed image, 
methods to reduce the varying dimension of the western blot data for a patient 
to a fixed dimension for the neural network will be investigated. This approach 
is desirable in that it will have substantially fewer network inputs than the full 
image approach. The basic problem is that the test yields a varying number of 
molecular weights that might by interrelated. Comparison of results from 
[ExamplelExample 5 and this example will indicate that patterns of molecular 
weights exist or if the weights are unrelated. Since there is some variability in 
the weight [datajdata^ the approach to the processing of this data will be similar 
to a fuzzy membership function, even though the classification will be 
performed with a neural network. 

Please amend the paragraph beginning on page 88, line 22. through page 
89, line 2, as follows: 

In order to test any of the data files produced by the above programs, the 
holdout methods was used with 80% of the data being used for training and the 
remaining 20% to be used for testing. Once the training data are produced form 
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the Western Blot data, a random number column and the Patient ID column was 
added in the Excel spreadsheet. The data was then sorted on the random 
number column. This in effect shuffles the data. In this way, it is likely that 
each partition has examples from each of the gels. With these percentages, five 
separate training and test files were produced [oslso as to allow a network 
performance to be estimated from the combined test set results. 

Please amend the paragraph beginning on page 101, line 23, through 
page 102, line 2, as follows: 
Methodology Used 

Training examples were built for the Stage desired output and the AFS 
score desired output. There were 7 patients missing Stage information and 28 
patients missing Score information. For the stage variable, the average value of 
2.09 was used where the data [was] were missing. For score, the missing data 
[waslwere replaced with a value depending on the value of the stage variable. 
For stage 1, a score of 3 was used. From stage 2, 10.5 was used. For stage 
3, 28 was used and for stage 4 the value 55 was used. 

Please amend the paragraph beginning on page 115, line 23, through 
page 116, line 6, as follows: 
Conclusions 

The variable selection process appears [worksi te work well and has 
produced two alternative networks that work as well or better than the pat07 
nets. The reason for this conclusion is that the performance statistics generated 
only on the training data appear slightly better for the pat07 [thenjthan pat08 
and pat09. Since the variable selection process carefully picks variables based 
on test set performance, the associated networks are not likely to have been 
overtrained. As a network becomes [overtrained 1 o vertrained , the typical 
characteristic is that the training example performance [increasesl increases. and 
the test set performance decreases, [Thusl Thus. the higher performance of 
pat07 may be the result of slight overtraining. 
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Please amend the paragraphs on page 124, lines 7-26; as follows: 
Results 

The count of variables found in the reduced subset run was smaller than 
that for the runs on the full training example. The typical performance of a 
consensus of networks was estimated using the holdout method with a partition 
of 5. The typical classification performance for the AFS run using the full 
training example was 77.22549%. The typical classification performance on 
the endo present subset was 63.008621 %. If all examples were classified as 
negative, the performance for the full training example would be 78.82% and 
65.29% for the subset. By changing the cutoff values for positive and negative 
classification better performance than suggested by these numbers can be 
achieved. 
Conclusions 

The results of the variable selection runs for the full training example and 
the subset of endo present examples [suggestslsjjgjgssi that the size of the 
training example is of importance in the determination of the important 
variables. It is clear that as the size of the training example increases, more 
variables will be considered important. This result can also be interpreted as an 
indication that more training data will improve the variable selection process and 
also the overall performance of the consensus networks used in building the 
diagnostic test. 

Please amend the paragraph beginning on page 125, line 24, through 
page 129, line 10, as follows: 
A. Variables 

The following are variables based on patient input data. Neural networks 
using all or selected subsets of these variables may be generated. 
Combinations of at least three of these variables may be used in conjunction 
with decision-support systems, particularly neural nets to predict risk of preterm 
delivery or impending delivery. The inputs for the variables are either yes, no, 
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no answer, or a text input, such as age. The variables, listed by type are as 
follows: 

1 Age 

Ethnic origin variables: 

2 EthOrgl: Caucasian; 

3 EthOrg2: Black; 

4 EthOrg3: Asian; 

5 Eth0rg4: Hispanic; 

6 EthOrg5: Native American; and 

7 EthOrgS: Other than the above. 
Marital status variables: 

8 MarStl: Single; 

9 MarSt2: Married; 

10 MarSt3: Divorced/Separated; 

1 1 MarSt4: Widowed; 

12 MarStS: Living with partner; or 

1 3 MarSt6: Other than those listed above. 
Education variables: 

14 EduO: Unknown; 

15 Edu1: <High School; 

16 Edu2: High School Graduate; or 

1 7 Edu3: College/trade. 
Patient complaint variables: 

18 PATIENT COMPLAINT #1 Patient has Uterine Contractions with 
or without pain; 

19 PATIENT COMPLAINT #2 Patient has intermittent lower 
abdominal pain, dull, low backache, pelvic pressure; 

20 PATIENT COMPLAINT #3 Patient has bleeding during the 
second or third trimester; 

21 PATIENT COMPLAINT #4 Patient has menstrual-like or intestinal 
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cramping? 

22 PATIENT COMPLAINT #5 Patient has change in vaginal 
discharge or amount, color, or consistency; or 

23 PATIENT COMPLAINT #6 Patient is not "feeling right". 
Variables from physician tests and assessments: 

24 Pooling refers to visual assessment to determine whether 
amniotic fluid has leaked into the vagina (see, e.g., Chapter 36, Section 18, p. 
657 in Maternal Fetal Medicine: Principle and Practice, 2nd Edition, Creasy, 
R.F. eta/., eds., W.B. Saunders & Co. 0989)); 

25 Ferning refers to the results of a test to detect the pattern 
formed when amniotic fluid is present in a cervical sample smeared on a clean 
slide and allowed to air dry (see, e.g., Chapter 36, Section 18, p. 657 in 
Maternal Fetal Medicine: Principle and Practice, 2nd Edition, Creasy, R.F. eta/., 
eds., W.B. Saunders & Co. (1989)}; 

26 Nitrazine refers to results from a known test used to measure 
the pH of amniotic fluid that has leaked into the vagina (see, e.g., Chapter 36, 
Section 18, p. 657 in Maternal Fetal Medicine: Principle and Practice, 2nd 
Edition, Creasy, R.F. eta/., eds., W.B. Saunders & Co.. (1989)); 

27 estimated gestational based (EGA) on last period (LMP); 

28 EGA by sonogram (SONO); 

29 EGA by Best-EGA is the best of the EGA by SONO and EGA by 
LMP determined as follows: 

if EGA by SONO is < 13 weeks, then EGA best is 
EGA SONO; 

if the difference by EGA by LMP and EGA by SONO is 
> 2 weeks, then EGA best is EGA by SONO; otherwise EGA 
best is EGA by LMP; 

30 EGA at Sampling refers to the EGA when fFN sampled; 
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31 CD INTERP. which refers to cervical dilatation (interpreted 
values - i.e. based on physicians estimates) where the number will be between 
0 and 10 cm and is determined from the physicians response; 

32 Gravity, which refers to the number of time woman has been 

pregnant; 

33 Parity-term, which refers to the number of term births; 

34 Parity-preterm, which refers to the number of preterm births; 

35 Parity-abortions, which refers to the number of pregnancies 
ending in spontaneous or elective abortions; 

36 Parity-living, which refers to the number of living children; 

37 Sex within 24 hrs prior to sampling for fFN; 

38 Vaginal bleeding at time of sampling; 

39 Cervical consistency at time of sampling; and 

40 UC INTERP, which refers to uterine contractions per hour as 
interpreted by the physician. 

[ComplicationsJ Complicatinns- 

41 0 COMP No previous pregnancies; 

42 1 COMP have had at least one previous pregnancy without 
complications; 

43 2nd comp atleast one preterm delivery {delivery prior to 35 

weeks); 

44 3rd comp at least one previous pregnancy with a premature 
rupture of membrane (PROM); 



cervix; 



45 4th comp at least one previous delivery with incompetent 



46 [5]5th COMP at least on previous pregnancy with pregnancy 
induced hypertension (PIH)/preeclampsia; 

47 f6]6th COMP at least one previous pregnancy with spontaneous 
abortion prior to 20 weeks; 
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48 OTHER COMP at least one previous pregnancy with a 
complication not listed above; and 

49 RESULT - fFN ELISA qualitative test result (if positive value is 1, 
if negative value is 0). 

Please amend the paragraphs on page 129, lines 14-29, as follows: 

B. A first set of neural networks demonstrating that the methods herein can 
be used to predict pregnancy related events 

EGA1-EGA4 

For these nets the preterm delivery is defined as less than or equal to 34 
weeks, 0 days. The other nets herein (described below) define preterm delivery 
as less than or equal 34 weeks, 6 days. 

Data [waslwere collected from the over 700 test patients involved in a 
clinical trial of the assay described in U.S. Patent No. 5,468,619. Variable 
selection was performed without fetal fibronectin (fFN) test data. The final 
networks, [designate EGA1 -EGA41desionated era 1 -Fn M , Were trained with 
the variables set forth in the table below. 

EGA1 - EGA4 represent neural networks used for variable selection. For 
EGA1, the variable selection protocol was performed on a network architecture 
with 8 inputs in the input layer, three processing elements in the hidden layer, 
and one output in the output layer. EGA2 is the same as EGA1, except that it is 
9 inputs in the input layer. 

Please amend the paragraphs beginning on page 140, line 9, through 
page 141, line 5. as follows: 

D. Neural network prediction of risk of delivery within 7 days- EGAD 7 
and EGAD7F 

1, Variable selection 
Using the same database described above for EGA1-EGA6, the variable 
selection protocol was applied to prediction of the risk for delivery within 7 days 
of sampling for the fFN test. As noted above for EGA 5 and EGA6, the variable 
selection procedure was applied in the absence of the fFN test result. 
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Application of the variable selection procedure to the 48 variables resulted In 
selection of the following variables: 

1- Ethnic Origin I: [Caucasionl Caucasian {he., yes or no); 

2. Uterine contractions with or without pain {i.e., yes or no); 

3. Parity-abortions; 

4.. Vaginal bleeding at time of sampling; 

5. Uterine contractions per hour; 

6. No previous pregnancies. 
2. Neural nets 

Using these variables two consensus networks were trained. One, 
designated [EGAD7]EGAD]L. was trained without including the results of the fFN 
ELISA test result, and the other, designated EGAD7f, was trained with the 
results of the fFN ELISA test result. 

Please amend the paragraph on page 146 f lines 4-8, as follows: 

EGAD7 is a set of 8 consensus networks trained similarly to EGAD7f, 
except that the input variables did not include the variable representing the 
result of the fFN ELISA test. This network can be used as a point of care 
application to give immediate [resultjresults to the clinician rather than the 24 to 
48 hours required to process the fFN sample. 

Please amend the paragraph beginning on page 153, line 1, through page 
154, line 5, as follows: . 

EXAMPLE 14 

Training of Consensus Neural Networks on Specific subsets of Pat07 Variables 

The example shows the results of a task designed to quantitate the 
contribution of pat07 variables to pat07 [performance.l oerformance and to 
develop endometriosis networks using minimal numbers of pat07 variables. 
Tasks: 

1 . Train final consensus networks using the following combination of Pat07 
variables: 

a. All 14 minus Hx Endo {13 variables total) 
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b. All 14 minus pelvic pain {13 variables total) 

c. All 14 minus dysmenorrhea (13 variables total} 

d. All 14 minus pelvic surgery (13 variables total) 

2. Train final consensus networks using other combinations of Pat07 
variables. 

a. Hx Endo, pelvic pain, and dysmenorrhea 

b. Hx Endo, pelvic pain, dysmenorrhea and Hx pelvic 
surgery 

3. Train final consensus networks using other combinations of pat07 
variables as indicated from above results. 

Please amend the paragraph beginning on page 156, line 23, through 
page 157, line 5, as follows: 

Referring to. Fig.. 10, a final indicator pair C, D is based on an analysis of a 
consensus of preliminary indicator pairs from a plurality, specifically eight, 
trained neural networks 10A - 10H (Fig. 10). Each preliminary indicator pair A, 
B is provided to one of two consensus processors 150, [152.J152 via paths 
133-140 and 141-148. The first consensus processor 150 processes all 
positive indicators. The second consensus processor 152 processes all negative 
indicators. Each consensus processor 150, 152 is an averager, i.e., it merely 
forms a linear combination, such as an average, of the collection of like 
preliminary indicator pairs A, B. The resultant confidence indicator pair is the 
desired result, where the inputs are the set of clinical factors for the patient 
under test. 

IN THE ABSTRACT: 

Please amend the abstract as follows: 

Please amend the paragraph on page 168, lines 1-5, as follows: 

ABSTRACT 

Computer systems and methods for diagnosing endometriosis and for 
assessing the risk of delivery within selected time period after performing a test 
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to assess the [ristjnsk of preterm delivery or before thirty-five weeks of 
gestation are provided. 

IN THE CLAIMS: 

Please amend claim 5 as follows: 

5. (Amended) The system of claim 3, wherein the time period is 
within [or] fourteen days of performing a biochemical test to measure fetal 
fibronectin. 
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